Homogeneity analysis using absolute deviations

نویسندگان

  • George Michailidis
  • Jan de Leeuw
چکیده

Homogeneity analysis is a technique for making graphical representations of categorical multivariate data sets. Such data sets can also be represented by the adjacency matrix of a bipartite graph. Homogeneity analysis optimizes a weighted least-squares criterion and the optimal graph layout is computed by an alternating least squares algorithm. Heiser Comput. Statist. Data Anal. (1987) 337, looked at homogeneity analysis under a more robust to outliers criterion, namely a weighted least absolute deviations criterion. In this paper, we take an in-depth look at the mathematical structure of this problem and show that the graph drawings are created by reciprocal computation of multivariate medians. Several algorithms for computing the solution are investigated and applications to actual data suggest that the resulting p-dimensional drawings (p¿ 2) are degenerate, in the sense that all object points are clustered in p + 1 locations. We also examine some variations of the criterion used and conclude that the generate solutions observed are a consequence of the normalization constraint employed in this class of problems. c © 2004 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatial homogeneity and redshift-distance laws.

Spatial homogeneity in the radial direction of low-redshift galaxies is subjected to Kafka-Schmidt V/V(m) tests using well-documented samples. Homogeneity is consistent with the assumption of the Lundmark (quadratic redshift-distance) law, but large deviations from homogeneity are implied by the assumption of the Hubble (linear redshift-distance) law. These deviations are similar to what would ...

متن کامل

A New Model for the Secondary Goal in DEA

The purpose of the current paper is to propose a new model for the secondary goal in DEA by introducing secondary objective function. The proposed new model minimizes the average of the absolute deviations of data points from their median. Similar problem is studied in a related model by Liang et al. (2008), which minimizes the average of the absolute deviations of data points from their mean. ...

متن کامل

Logical Analysis of Multiclass Data with Relaxed Patterns

This paper proposes a relaxed algorithm based on mixed integer linear programming (MILP) to extend the LAD methodology to solve multi-class classification problems, where One-vs-Rest (OvR) learning models are constructed to classify observations in predefined classes. The suggested algorithm has two control parameters, homogeneity and prevalence, for improving the classification accuracy of the...

متن کامل

Regression Model Estimation Using Least Absolute Deviations , Least Squares Deviations and Minimax Absolute Deviations Criteria

Regression models and their statistical analyses is the most important tool used by scientists in data analyses especially for modeling the relationship among random variables and making predictions with higher accuracy. A fundamental problem in the theory of errors, which has drawn attention of leading mathematicians and scientists since past few centuries, was that of fitting functions. For t...

متن کامل

تکرارپذیری نسبی و مطلق آزمون Timed Up and Go در سالمندان ساکن اجتماع و جوانان سالم

Objectives: Relative and absolute reliability are psychometric properties of the test that many clinical decisions are based on them. In many cases, only relative reliability takes into consideration while the absolute reliability is also very important. Methods & Materials: Eleven community-dwelling older adults aged 65 years and older (69.64±3.58) and 20 healthy young in the age ran...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 48  شماره 

صفحات  -

تاریخ انتشار 2005